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IDENTIFICATION OF BROADLY REACTIVE DR RESTRICTED EPITOPES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application is a continuation in part of USSN 60/036,713, filed 
January 23, 1997 and 60/037,432 filed February 7, 1997, both of which are incorporated 
herein by reference. 

BACKGROUND OF THE INVENTION 

Helper T lymphocytes (HTL) play several important functions in immunity 
to pathogens. Firstly, they provide help for induction of both CTL and antibody 
responses. By both direct contact and by secreting lymphokines such as IL2 and IL4, 
HTL promote and support the expansion and differentiation of T and B cell precursors into 
effector cells. In addition, HTL can also be effectors in their own right, an activity also 
mediated by direct cell contact and secretion of lymphokines, such as IFNy and TNFa. 
HTL have been shown to have direct effector activity in case of tumors, as well as viral, 
bacterial, parasitic, and fungal infections. 

HTL recognize a complex formed between Class II MHC molecules and 
antigenic peptides, usually between 10 and 20 residues long, and with an average size of 
between 13 and 16 amino acids. Peptide-Class II interactions have been analyzed in 
detail, both at the structural and functional level, and peptide motifs specific for various 
human and mouse Class II molecules have been proposed. 

In the last few years, epitope based vaccines have received considerable 
attention as a possible mean to develop novel prophylactic vaccines and immunotherapeutic 
strategies. Selection of appropriate T and B cell epitopes should allow to focus the 
immune system toward conserved epitopes of pathogens which are characterized by high 
sequence variability (such as HIV, HCV and Malaria). 

In addition, focusing the immune response towards selected determinants 
could be of value in the case of various chronic viral diseases and cancer, where T cells 
directed against the immunodominant epitopes might have been inactivated while T cells 
specific for subdominant epitopes might have escaped T cell tolerance. The use of epitope 
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based vaccines also allows to avoid "suppressive" T cell determinants which induce TH 2 
responses, in conditions where a THj response is desirable, or vice versa. 

Finally, epitope based vaccines also offer the opportunity to include in the 
vaccine construct epitopes that have been engineered to modulate their potency, either by 
5 increasing MHC binding affinity, or by alteration of its TCR contact residues, or both. 
Inclusion of completely synthetic non-natural or generically unrelated to the pathogen 
epitopes (such as TT derived "universal" epitopes), also represents a possible mean of 
modulating the HTL response toward a TH,, or TH 2 phenotype. 

Once appropriate epitope determinants have been defined, they can be 
10 assorted and delivered by various means, which include lipopeptides, viral delivery 
vectors, particles of viral or synthetic origin, naked or particle absorbed cDNA. 

However, before appropriate epitopes can be defined, one major obstacle 
has to be overcome, namely the very high degree of polymorphism of the MHC molecules 
expressed in the human population. In fact, more than two hundred different types of 
15 HLA Class I and Class II molecules have already been identified. It has been 

demonstrated that in the case of HLA Class I molecules, peptides capable of binding 
several different HLA Class I molecules can be identified. Over 60 % of the known HLA 
Class I molecules can, in fact, be grouped in four broad HLA supertypes, characterized by 
similar peptide binding specificities (HLA supennotifs). 
20 In the case of Class III molecules, it is also known that peptides capable of 

binding multiple HLA types and of being immunogenic in the context of different HLA 
molecules do indeed exist. Until now, however, a general method for their identification 
has not been developed, probably at least in part a reflection of the fact that quantitative 
DR binding assays are labor intensive and that a large number of alleles must to be 
25 considered. 

The present invention addresses these and other needs. 
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SUMMARY OF THE INVENTION 

The present invention is based, at least in part, on the discovery and 
validation of specific motifs and assay systems for various DR molecules, representative of 
the worldwide population. Their application to the identification of broadly degenerate 
HLA Class II binding peptides is also described. 

Definitions 

The term "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other typically by peptide bonds between the alpha-amino and carbonyl groups 
of adjacent amino acids. The oligopeptides of the invention are less than about 50 residues 
in length and usually consist of between about 10 and about 30 residues, more usually 
between about 12 and 25, and often between about 15 and about 20 residues. 

An "immunogenic peptide" is a peptide which comprises an allele-specific 
motif such that the peptide will bind an MHC molecule and induce a HTL response. 
Immunogenic peptides of the invention are capable of binding to an appropriate HLA 
molecule and inducing HTL response against the antigen from which the immunogenic 
peptide is derived. 

A "conserved residue" is a conserved amino acid occupying a particular 
position in a peptide motif typically one where the MHC structure may provide a contact 
point with the immunogenic peptide. One to three, typically two, conserved residues 
within a peptide of defined length defines a motif for an immunogenic peptide. These 
residues are typically in close contact with the peptide binding groove, with their side 
chains buried in specific pockets of the groove itself. 

The term "motif" refers to the pattern of residues of defined length, usually 
between about 8 to about 11 amino acids, which is recognized by a particular MHC allele. 

The term "supermotif " refers to motifs that, when present in an 
immunogenic peptide, allow the peptide to bind more than one HLA antigen. The 
supermotif preferably is recognized by at least one HLA allele having a wide distribution 
in the human population, preferably recognized by at least two alleles, more preferably 
recognized by at least three alleles, and most preferably recognized by more than three 
alleles. 
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The phrases "isolated" or "biologically pure" refer to material which is 
substantially or essentially free from components which normally accompany it as found in 
its native state. Thus, the peptides of this invention do not contain materials normally 
associated with their in situ environment, e.g., MHC I molecules on antigen presenting 
5 cells. Even where a protein has been isolated to a homogenous or dominant band, there 
are trace contaminants in the range of 5-10% of native protein which co-purify with the 
desired protein. Isolated peptides of this invention do not contain such endogenous co- 
purified protein. 

The term "residue" refers to an amino acid or amino acid mimetic 
10 incorporated in an oligopeptide by an amide bond or amide bond mimetic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR4w4 binding capacity when occupying a particular 
15 position, relative to the main P1-P6 anchors. 

Figure 2A presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR1 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

Figure 2B presents a map of the positive or negative effect of each of the 20 
20 naturally occurring amino acids on DR7 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention relates to compositions and methods for preventing, 
25 treating or diagnosing a number of pathological states such as viral, fungal, bacterial and 
parasitic diseases and cancers. In particular, it provides novel peptides capable of binding 
selected major histocompatibility complex (MHC) class II molecules and inducing an 
immune response. 

Peptide binding to MHC molecules is determined by the allelic type of the 
30 MHC molecule and the amino acid sequence of the peptide. MHC class I-binding peptides 
usually contain within their sequence two conserved ("anchor") residues that interact with 
corresponding binding pockets in the MHC molecule. Specific combination of anchor 
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residues (usually referred to as "MHC motifs") required for binding by several allelic 
forms of human MHC (HLA, histocompatibility leukocyte antigens) are described in 
International Applications WO 94/03205 and WO 94/20127. Definition of specific MHC 
motifs allows one to predict from the amino acid sequence of an individual protein, which 
5 peptides have the potential of being immunogenic for CTL. These applications describe 
methods for preparation and use of immunogenic peptides in the treatment of disease. The 
peptides described here can also be used as helper T peptides in combination with peptides 
which induce a CTL response. This is described in WO 95/07077. 

The DR-binding peptides of the present invention or nucleic acids encoding 

10 them can be administered to mammals, particularly humans, for prophylactic and/or 

therapeutic purposes. The DR peptides can be used to enhance immune responses against 
other immunogens administered with the peptides, when the peptides of the invention are 
used as helper peptides. For instance, mixtures of peptides of the invention in 
combination with peptides that induce CTL responses may be used to treat and/or prevent 

15 viral infection and cancer. Alternatively, immunogens which induce antibody responses 
can be used. Examples of diseases which can be treated using the immunogenic mixtures 
of DR peptides and other immunogens include prostate cancer, hepatitis B, hepatitis C, 
AIDS, renal carcinoma, cervical carcinoma, lymphoma, CMV and condyloma 
acuminatum. 

20 The DR-binding peptides or nucleic acids encoding them may also be used 

to treat a variety of conditions involving unwanted T cell reactivity. Examples of diseases 
which can be treated using DR-binding peptides include autoimmune diseases (e.g., 
rheumatoid arthritis, multiple sclerosis, and myasthenia gravis), allograft rejection, 
allergies (e.g., pollen allergies), lyme disease, hepatitis, LCMV, post-streptococcal 

25 endocarditis, or glomerulonephritis, and food hypersensitivities. 

In therapeutic applications, the immunogenic compositions or the DR- 
binding peptides or nucleic acids of the invention are administered to an individual already 
suffering from cancer, autoimmune disease, or infected with the virus of interest. Those 
in the incubation phase or the acute phase of the disease may be treated with the DR- 

30 binding peptides or immunogenic conjugates separately or in conjunction with other 
treatments, as appropriate. 
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In therapeutic applications, compositions comprising immunogenic 
compositions are administered to a patient in an amount sufficient to elicit an effective 
immune response to the virus or tumor antigen and to cure or at least partially arrest 
symptoms and/or complications. Similarly, compositions comprising DR-binding peptides 
5 are administered in an amount sufficient to cure or at least partially arrest the symptoms of 
the disease and its complications. An amount adequate to accomplish this is defined as 
"therapeutically effective dose." Amounts effective for this use will depend on, e.g., the 
peptide composition, the manner of administration, the stage and severity of the disease 
being treated, the weight and general state of health of the patient, and the judgment of the 

10 prescribing physician. 

Therapeutically effective amounts of the immunogenic compositions of the 
present invention generally range for the initial immunization (that is for therapeutic or 
prophylactic administration) from about 1.0- fig to about 10,000 fig of peptide for a 70 kg 
patient, usually from about 100 to about 8000 fig, and preferably between about 200 and 

15 about 6000 fig. These doses are followed by boosting dosages of from about 1.0 fig to 

about 1000 fig of peptide pursuant to a boosting regimen over weeks to months depending 
upon the patient's response and condition by measuring specific immunogenic activity in 
the patient's blood. 

It must be kept in mind that the compositions of the present invention may 

20 generally be employed in serious disease states, that is, life-threatening or potentially life- 
threatening situations. In such cases, in view of the minimization of extraneous substances 
and the relative nontoxic nature of the conjugates, it is possible and may be felt desirable 
by the treating physician to administer substantial excesses of these compositions. 

For prophylactic use, administration should be given to risk groups. For 

25 example, protection against malaria, hepatitis, or AIDS may be accomplished by 

prophylactically administering compositions of the invention, thereby increasing immune 
capacity. Therapeutic administration may begin at the first sign of disease or the detection 
or surgical removal of tumors or shortly after diagnosis in the case of acute infection. 
This is followed by boosting doses until at least symptoms are substantially abated and for 

30 a period thereafter. In chronic infection, loading doses followed by boosting doses may be 
required. 
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Treatment of an infected individual with the compositions of the invention 
may hasten resolution of the infection in acutely infected individuals. For those 
individuals susceptible (or predisposed) to developing chronic infection the compositions 
are particularly useful in methods for preventing the evolution from acute to chronic 

5 infection. Where the susceptible individuals are identified prior to or during infection, for 
instance, as described herein, the composition can be targeted to them, minimizing need 
for administration to a larger population. 

The peptide mixtures or conjugates can also be used for the treatment of 
chronic infection and to stimulate the immune system to eliminate virus-infected cells in 

10 carriers. It is important to provide an amount of immuno-potentiating peptide in a 

formulation and mode of administration sufficient to effectively stimulate a cytotoxic T 
cell response. Thus, for treatment of chronic infection, a representative dose is in the 
range of about 1.0 fig to about 5000 /ig, preferably about 5 fig to 1000 fig for a 70 kg 
patient per dose. Immunizing doses followed by boosting doses at established intervals, 

15 e.g., from one to four weeks, may be required, possibly for a prolonged period of time to 
effectively immunize an individual. In the case of chronic infection, administration should 
continue until at least clinical symptoms or laboratory tests indicate that the viral infection 
has been eliminated or substantially abated and for a period thereafter. 

The pharmaceutical compositions for therapeutic or prophylactic treatment 

20 are intended for parenteral, topical, oral or local administration. Typically, the 
pharmaceutical compositions are administered parenterally, e.g., intravenously, 
subcutaneously, intradermal!/, or intramuscularly. Because of the ease of administration, 
the vaccine compositions of the invention are particularly suitable for oral administration. 
Thus, the invention provides compositions for parenteral administration which comprise a 

25 solution of the peptides or conjugates dissolved or suspended in an acceptable carrier, 
preferably an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, 
buffered water, 0.9% saline, 0.3% glycine, hyaluronic acid and the like. These 
compositions may be sterilized by conventional, well known sterilization techniques, or 
may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or 

30 lyophilized, the lyophilized preparation being combined with a sterile solution prior to 
administration. The compositions may contain pharmaceutical^ acceptable auxiliary 
substances as required to approximate physiological conditions, such as pH adjusting and 
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buffering agents, tonicity adjusting agents, wetting agents and the like, for example, 
sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, 
sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of DR and/or CTL stimulatory peptides of the invention 
in the pharmaceutical formulations can vary widely, i.e., from less than about 0.1%, 
usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be 
selected primarily by fluid volumes, viscosities, etc., in accordance with the particular 
mode of administration selected. 

The peptides and conjugates of the invention may also be administered via 
liposomes, which serve to target the conjugates to a particular tissue, such as lymphoid 
tissue, or targeted selectively to infected cells, as well as increase the half-life of the 
peptide composition. Liposomes include emulsions, foams, micelles, insoluble 
monolayers, liquid crystals, phospholipid dispersions, lamellar layers and the like. In 
these preparations the peptide to be delivered is incorporated as part of a liposome, alone 
or in conjunction with a molecule which binds to, e.g., a receptor prevalent among 
lymphoid cells, such as monoclonal antibodies which bind to the CD45 antigen, or with 
other therapeutic or immunogenic compositions. Thus, liposomes fdled with a desired 
peptide or conjugate of the invention can be directed to the site of lymphoid cells, where 
the liposomes then deliver the selected therapeutic/immunogenic peptide compositions. 
Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
generally include neutral and negatively charged phospholipids and a sterol, such as 
cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome 
size, acid lability and stability of the liposomes in the blood stream. A variety of methods 
are available for preparing liposomes, as described in, e.g., Szoka, et aL, Ann. Rev. 
Biophys. Bioeng. 9, 467 (1980), U.S. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 
5,019,369, incorporated herein by reference. 

For targeting to the immune cells, a ligand to be incorporated into the 
liposome can include, e.g. , antibodies or fragments thereof specific for cell surface 
determinants of the desired immune system cells. A liposome suspension containing a 
peptide or conjugate may be administered intravenously, locally, topically, etc. in a dose 
which varies according to, inter alia, the manner of administration, the conjugate being 
delivered, and the stage of the disease being treated. 
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Alternatively, DNA or RNA encoding one or more DR peptides and a 
polypeptide containing one or more CTL epitopes or antibody inducing epitopes may be 
introduced into patients to obtain an immune response to the polypeptides which the 
nucleic acid encodes. Wolff, et. al., Science 247: 1465-1468 (1990) describes the use of 
5 nucleic acids to produce expression of the genes which the nucleic acids encode. Such use 
is also disclosed in U.S. Patent Nos. 5,580,859 and 5,589,466. The nucleic acids can also 
be administered using ballistic delivery as described, for instance, in U.S. Patent No. 
5,204,253. Particles comprised solely of DNA can be administered. Alternatively, DNA 
can be adhered to particles, such as gold particles. The nucleci acids can also be delivered 

10 complexed to cationic compounds, such as cationic lipids. Lipid-mediated gene delivery 
methods are described, for instance, in WO 96/18372; WO 93/24640; Mannino and 
Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; WO 
91/06309; and Feigner etal. (1987) Proc. Natl Acad, ScL USA 84: 7413-7414. The 
peptides of the invention can also be expressed by attenuated viral hosts, such as vaccinia 

15 or fowlpox. This approach involves the use of vaccinia virus as a vector to express 

nucleotide sequences that encode the peptides of the invention. Upon introduction into an 
acutely or chronically infected host or into a noninfected host, the recombinant vaccinia 
virus expresses the immunogenic peptide, and thereby elicits a host CTL response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 

20 Patent No. 4,722,848, incorporated herein by reference. Another vector is BCG (Bacille 
Calmette Guerin). BCG vectors are described in Stover et al. (Nature 351:456-460 
(1991)) which is incorporated herein by reference. A wide variety of other vectors useful 
for therapeutic administration or immunization of the peptides of the invention, e.g., 
Salmonella typhi vectors and the like, will be apparent to those skilled in the art from the 

25 description herein. 

A preferred means of administering nucleic acids encoding the peptides of 
the invention uses minigene constructs encoding multiple peptides of the invention along 
with CTL inducing peptides. To create a DNA sequence encoding the selected DR 
peptides and CTL epitopes for expression in human cells, the amino acid sequences of the 

30 epitopes are reverse translated. A human codon usage table is used to guide the codon 
choice for each amino acid. These epitope-encoding DNA sequences are directly 
adjoined, creating a continuous polypeptide sequence. To optimize expression and/or 
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immunogenicity, additional elements can be incorporated into the minigene design. 
Examples of amino acid sequence that could be reverse translated and included in the 
minigene sequence include: DR peptides of the invention, a leader (signal) sequence, one 
or more CTL epitope, and an endoplasmic reticulum retention signal. In addition, MHC 
5 presentation of CTL epitopes may be improved by including synthetic (e.g. poly-alanine) 
or naturally -occurring flanking sequences adjacent to the CTL epitopes. 

The minigene sequence is converted to DNA by assembling oligonucleotides 
that encode the plus and minus strands of the minigene. Overlapping oligonucleotides (30- 
100 bases long) are synthesized, phosphorylated, purified and annealed under appropriate 

10 conditions using well known techniques, he ends of the oligonucleotides are joined using 
T4 DNA ligase. This synthetic minigene, encoding the CTL epitope polypeptide, can then 
cloned into a desired expression vector. 

Standard regulatory sequences well known to those of skill in the art are 
included in the vector to ensure expression in the target cells. Several vector elements are 

15 required: a promoter with a down-stream cloning site for minigene insertion; a 
polyadenylation signal for efficient transcription termination; an E. coli origin of 
replication; and an£. coli selectable marker (e.g. ampicillin or kanamycin resistance). 
Numerous promoters can be used for this purpose, e.g., the human cytomegalovirus 
(hCMV) promoter. See, U.S. Patent Nos. 5,580,859 and 5,589,466 for other suitable 

20 promoter sequences. 

Additional vector modifications may be desired to optimize minigene 
expression and immunogenicity. In some cases, introns are required for efficient gene 
expression, and one or more synthetic or naturally-occurring introns could be incorporated 
into the transcribed region of the minigene. The inclusion of mRNA stabilization 

25 sequences can also be considered for increasing minigene expression. It has recently been 
proposed that immunostimulatory sequences (ISSs or CpGs) play a role in the 
immunogenicity of DNA vaccines. These sequences could be included in the vector, 
outside the minigene coding sequence, if found to enhance immunogenicity. 

In some embodiments, a bicistronic expression vector, to allow production 

30 of the minigene-encoded epitopes and a second protein included to enhance or decrease 

immunogenicity can be used. Examples of proteins or polypeptides that could beneficially 
enhance the immune response if co-expressed include cytokines (e.g., IL2,TL12, GM- 
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CSF), cytokine-inducing molecules (e.g. LeIF) or costiraulatory molecules. The HTL 
epitopes of the invention could be joined to intracellular targeting signals and expressed 
separately from the CTL epitopes. This would allow direction of the HTL epitopes to a 
cell compartment different than the CTL epitopes. If required, this could facilitate more 
efficient entry of HTL epitopes into the MHC class II pathway, thereby improving CTL 
induction. In contrast to CTL induction, specifically decreasing the immune response by 
co-expression of immunosuppressive molecules (e.g. TGF-p) may be beneficial in certain 
diseases. 

Once an expression vector is selected, the minigene is cloned into the 
polylinker region downstream of the promoter. This plasmid is transformed into an 
appropriate E. coli strain, and DNA is prepared using standard techniques. The 
orientation and DNA sequence of the minigene, as well as all other elements included in 
the vector, are confirmed using restriction mapping and DNA sequence analysis. Bacterial 
cells harboring the correct plasmid can be stored as a master cell bank and a working cell 
bank. 

Therapeutic quantities of plasmid DNA are produced by fermentation in E. 
coli, followed by purification. Aliquots from the working cell bank are used to inoculate 
fermentation medium (such as Terrific Broth), and grown to saturation in shaker flasks or 
a bioreactor according to well known techniques. Plasmid DNA can be purified using 
standard bioseparation technologies such as solid phase anion-exchange resins supplied by 
Quiagen. If required, supercoiled DNA can be isolated from the open circular and linear 
forms using gel electrophoresis or other methods. 

Purified plasmid DNA can be prepared for injection using a variety of 
formulations. The simplest of these is reconstitution of lyophilized DNA in sterile 
phosphate-buffer saline (PBS). A variety of methods have been described, and new 
techniques may become available. As noted above, nucleic acids are conveniently 
formulated with cationic lipids. In addition, glycolipids, fusogenic liposomes, peptides 
and compounds referred to collectively as protective, interactive, non-condensing (PINC) 
could also be complexed to purified plasmid DNA to influence variables such as stability, 
intramuscular dispersion, or trafficking to specific organs or cell types. 

Target cell sensitization can be used as a functional assay for expression and 
MHC class I presentation of minigene-encoded CTL epitopes. The plasmid DNA is 
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introduced into a mammalian cell line that is suitable as a target for standard CTL 
chromium release assays. The transfection method used will be dependent on the final 
formulation. Electroporation can be used for "naked" DNA, whereas cationic lipids allow 
direct in vitro transfection. A plasmid expressing green fluorescent protein (GFP) can be 
5 co-transfected to allow enrichment of transfected cells using fluorescence activated cell 
sorting (FACS). These cells are then chromium-51 labeled and used as target cells for 
epitope-specific CTL lines. Cytolysis, detected by 51Cr release, indicates production of 
MHC presentation of minigene-encoded CTL epitopes. 

In vivo immunogenicity is a second approach for functional testing of 

10 minigene DNA formulations. Transgenic mice expressing appropriate human MHC 

molecules are immunized with the DNA product. The dose and route of administration 
are formulation dependent (e.g. IM for DNA in PBS, IP for lipid-complexed DNA). 
Twenty-one days after immunization, splenocytes are harvested and restimulated for 1 
week in the presence of peptides encoding each epitope being tested. These effector cells 

15 (CTLs) are assayed for cytolysis of peptide-loaded, chromium-51 labeled target cells using 
standard techniques. Lysis of target cells sensitized by MHC loading of peptides 
corresponding to minigene-encoded epitopes demonstrates DNA vaccine function for in 
vivo induction of CTLs. 

For solid compositions, conventional nontoxic solid carriers may be used 

20 which include, for example, pharmaceutical grades of mannitol, lactose, starch, 

magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharmaceutical^ acceptable nontoxic 
composition is formed by incorporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 10-95% of active ingredient, that is, one or 

25 more conjugates of the invention, and more preferably at a concentration of 25%-75%. 

For aerosol administration, the peptides are preferably supplied in finely 
divided form along with a surfactant and propellant. Typical percentages of conjugates are 
0.01%-20% by weight, preferably 1%-10%. The surfactant must, of course, be nontoxic, 
and preferably soluble in the propellant. Representative of such agents are the esters or 

30 partial esters of fatty acids containing from 6 to 22 carbon atoms, such as caproic, 

octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an 
aliphatic polyhydric alcohol or its cyclic anhydride. Mixed esters, such as mixed or 
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natural glycerides may be employed. The surfactant may constitute 0.1%-20% by weight 
of the composition, preferably 0.25-5%. The balance of the composition is ordinarily 
propellant. A carrier can also be included, as desired, as with, e.g. , lecithin for intranasal 
delivery. 

5 In another aspect the present invention is directed to vaccines which contain 

as an active ingredient an immunogenically effective amount of an immunogenic DR 
peptide or a CTIADR peptide conjugate or nucleic acid encoding them as described herein. 
The conjugate(s) may be introduced into a host, including humans, linked to its own 
carrier or as a homopolymer or heteropolymer of active peptide units. Such a polymer has 

10 the advantage of increased immunological reaction and, where different peptides are used 
to make up the polymer, the additional ability to induce antibodies and/or CTLs that react 
with different antigenic determinants of the virus or tumor cells. Useful carriers are well 
known in the art, and include, e.g., thyroglobulin, albumins such as bovine serum 
albumin, tetanus toxoid, polyamino acids such as poly(lysine: glutamic acid), hepatitis B 

15 virus core protein, hepatitis B virus recombinant vaccine and the like. The vaccines can 
also contain a physiologically tolerable (acceptable) diluent such as water, phosphate 
buffered saline, or saline, and further typically include an adjuvant. Adjuvants such as 
incomplete Freund's adjuvant, aluminum phosphate, aluminum hydroxide, or alum are 
materials well known in the art. And, as mentioned above, CTL responses can be primed 

20 by conjugating peptides of the invention to lipids, such as P 3 CSS. Upon immunization 

with a peptide composition as described herein, via injection, aerosol, oral, transdermal or 
other route, the immune system of the host responds to the vaccine by producing large 
amounts of CTLs specific for the desired antigen, and the host becomes at least partially 
immune to later infection, or resistant to developing chronic infection. 

25 Vaccine compositions containing the DR peptides of the invention are 

administered to a patient susceptible to or otherwise at risk of disease, such as viral 
infection or cancer to elicit an immune response against the antigen and thus enhance the 
patient's own immune response capabilities, for instance with CTL epitopes described in 
**. Such an amount is defined to be an "immunogenically effective dose. " In this use, the 

30 precise amounts again depend on the patient's state of health and weight, the mode of 

administration, the nature of the formulation, etc., but generally range from about 1.0 jig 
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to about 5000 ^g per 70 kilogram patient, more commonly from about 10 /ig to about 500 
fig per 70 kg of body weight. 

In some instances it may be desirable to combine the peptide vaccines of the 
invention with vaccines which induce neutralizing antibody responses to the virus of 
5 interest, particularly to viral envelope antigens. For instance, PADRE peptides can be 
combined with hepatitis vaccines to increase potency or broaden population coverage. 
Suitable hepatitis vaccines that can be used in this manner include, Recombivax HB® 
(Merck) and Engerix-B (Smith-Kline). 

For therapeutic or immunization purposes, the peptides of the invention can 

10 also be expressed by attenuated viral hosts, such as vaccinia or fowlpox. This approach 
involves the use of vaccinia virus as a vector to express nucleotide sequences that encode 
the peptides of the invention. Upon introduction into an acutely or chronically infected 
host or into a non-infected host, the recombinant vaccinia virus expresses the immunogenic 
peptide, and thereby elicits a host CTL response. Vaccinia vectors and methods useful in 

15 immunization protocols are described in, e.g., U.S. Patent No. 4,722,848, incorporated 
herein by reference. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al. f Nature 351, 456-460 (1991)) which is incorporated herein by 
reference. A wide variety of other vectors useful for therapeutic administration or 
immunization of the peptides of the invention, e.g., Salmonella typhi vectors and the like, 

20 will be apparent to those skilled in the art from the description herein. 

Antigenic conjugates may be used to elicit CTL ex vivo, as well. The 
resulting CTL can be used to treat chronic infections (viral or bacterial) or tumors in 
patients that do not respond to other conventional forms of therapy, or will not respond to 
a peptide vaccine approach of therapy. Ex vivo CTL responses to a particular pathogen 

25 (infectious agent or tumor antigen) are induced by incubating in tissue culture the patient's 
CTL precursor cells (CTLp) together with a source of antigen-presenting cells (APC) and 
the appropriate immunogenic peptide. After an appropriate incubation time (typically 1-4 
weeks), in which the CTLp are activated and mature and expand into effector CTL, the 
cells are inftised back into the patient, where they will destroy their specific target cell (an 

30 infected cell or a tumor cell). 

The peptides of this invention may also be used to make monoclonal 
antibodies. Such antibodies may be useful as potential diagnostic or therapeutic agents. 
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The peptides may also find use as diagnostic reagents. For example, a 
peptide of the invention may be used to determine the susceptibility of a particular 
individual to a treatment regimen which employs the peptide or related peptides, and thus 
may be helpful in modifying an existing treatment protocol or in determining a prognosis 
for an affected individual. In addition, the peptides may also be used to predict which 
individuals will be at substantial risk for developing chronic infection. 

Examples 

Materials and Methods 

Cells. The following Epstein-Barr virus (EBV) transformed homozygous cell lines 
were used as sources of human HLA Class II molecules: LG2 [DRBlcOlOl (DR1)1; 
GM3107 [DRB50101 (DR2w2a)]; MAT (DRB10301 (DR3)1; PREISS [DRB10401 
(DR4w4)l; BIN40 [DRB10404 (DR4wl4)l; SWEIG [DRB11101 (DRSwll)]; PITOUT 
[DRB10701 (DR7)] (a); KT3 [DRB10405 (DR4wl5)]; Herluf [DRB 11201 (DR5wl2)]; 
HO301 [DRB11302 (DR6wl9)]; OLL [DRB10802 (DR8w2)]; and HTC9074 [DRB10901 
(DR9), supplied as a kind gift by Dr. Paul Harris, Columbia University]. In some 
instances, transfected fibroblasts were used: L466.1 [DRB11501 (DR2w2b)]; TR81.19 
[DRB30101 (DR52a)]; and L257.6 [DRB40101 (DRw53)]. (Valli, et al 7. Clin. Invest. 
91:616 (1993). Cells were maintained in vitro by culture in RPMI 1640 medium 
supplemented with 2mM L-glutamine [GIBCO, Grand Island, NY], 50fxM 2-ME, and 
10% heat-inactivated FCS [Irvine Scientific, Santa Ana, CA]. Cells were also 
supplemented with 100 /ig/ml of streptomycin and lOOU/ml of penicillin [Irvine 
Scientific], Large quantities of cells were grown in spinner cultures. 

Cells were lysed at a concentration of 10 8 cells/ml in PBS containing 1 % 
NP-40 [Fluka Biochemika, Buchs, Switzerland], ImM PMSF [CalBioChem, La Jolla, 
CA], 5mM Na-ortho vanadate, and 25mM iodoacetamide [Sigma Chemical, St. Louis, 
Mo], The lysates were cleared of debris and nuclei by centrifugation at 10,000 x g for 20 
min. 

Affinity purification of HLA-DR molecules. Class II molecules were purified by 
affinity chromatography as previously described (Sette, et al /. Immunol 142:35 (1989) 
and Gorga, et al 7. Biol Chem. 262:16087 (1987)) using the mAb LB3.1 coupled to 
Sepharose 4B beads. Lysates were filtered through 0.8 and 0.4 yM filters and then passed 
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over the anti-DR column, which were then washed with 15-column volumes of lOmM 
TRIS in 1% NP-40, PBS and 2-column volumes of PBS containing 0.4% 
n-octylglucoside. Finally, the DR was eluted with 50mM diethylamine in 0.15M NaCl 
containing 0.4% n-octylglucoside, pH 11.5. A 1/25 volume of 2.0M Tris, pH 6.8, was 
5 added to the eluate to reduce the pH to -8.0, and then concentrated by centrifugation in 
Centriprep 30 concentrators at 2000 rpm (Amicon, Beverly, MA). 

Class II peptide-binding assays. A panel of 13 different specific DR-peptide assays were 
utilized in the present study. These assays were chosen as to be representative of the most 

10 common DR alleles. Table I lists for each DR antigen, the representative allelic product 
utilized, the cell line utilized as a source of DR, and the radiolabled probe utilized in the 
assay. Purified human Class II molecules [5 to 500 nM] were incubated with various 
unlabeled peptide inhibitors and 1-10 nM 125 I-radiolabeled probe peptides for 48h in PBS 
containing 5% DMSO in the presence of a protease inhibitor cocktail. The radiolabeled 

15 probes used were HA Y307-319 (DR1), Tetanus ToxoidfTT] 830-843 (DR2w2a, 

DRSwlll, DR7, DR8w2, DR8w3, DR9), MBP Y85-100 (DR2w2b), TT1272-1284 
(DR52a), MT 65 kD Y3-13 with Y7 substituted with F for DR3, a non-natural peptide 
with the sequence YARFQSQTTLKQKT (DR4w4, DR4wl5, DRw53) (Valli, et al 
supra), and for DR5wl2, a naturally processed peptide eluted from the cell line C1R, 

20 EALIHQLINPYVLS (DR5wl2) and 650.22 peptide, (TT 830-843 A - S836 analog), for 
DR6wl9. 

Radiolabeled peptides were iodinated using the chloramine-T method. 
Peptide inhibitors were typically tested at concentrations ranging from 1201 /-eg/ml to 1.2 
ng/ml. The data were then plotted and the dose yielding 50% inhibition (IC50) was 

25 measured. In appropriate stoichiometric conditions, the IC50 of an unlabeled test peptide 
to the purified DR is a reasonable approximation of the affinity of interaction (Kd). 
Peptides were tested in two to four completely independent experiments. The final 
concentrations of protease inhibitors were: ImM PMSF, 1.3nM 1.10 phenanthroline, 73 
fM pepstatin A, 8mM EDTA, and 200 N alpha-p-tosyl-L-lysine chloromethyl ketone 

30 (TLCK) [All protease inhibitors from CalBioChem, La Jolla, CA]. Final detergent 

concentration in the incubation mixture was 0.05% Nonidet P-40. Assays were performed 
at pH 7.0 with the exception of DR3, which was performed at pH 4.5, and DRw53, which 
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was performed at pH 5.0. The pH was adjusted as previously described (Sette, et al J. 
Immunol 148:844 (1992)). 

Class II peptide complexes were separated from free peptide by gel 
filtration on TSK2000 columns (TosoHaas 16215, Montgomery ville, PA), and the fraction 
5 of bound peptide calculated as previously described (Sette, et al , (1989) supra). In 
preliminary experiments, the DR prep was titered in the presence of fixed amounts of 
radiolabeled peptides to determine the concentration of Class II molecules necessary to 
bind 10-20% of the total radioactivity. All subsequent inhibition and direct binding assays 
were the performed using these Class II concentrations. 

10 

DRB1 specificity of DR4wl5, DR6wl9, DR8w2, DR8w3, and DR9 assays. 

Because the antibody used for purification is a-chain specific, pi molecules 
are not separated from P3 (and/or p4 and P5) molecules. Development and validation of 
assays in regard with DRp chain specificity has been described in detail elsewhere for 

15 many of the DR alleles listed above (108). Herein we describe for the first time DR4wl5, 
DR6wl9, DR8w2, DR8w3, and DR9 assays. Experiments addressing the P chain 
specificity of these new assays are described in the present section. 

DR4wl5. The P4 product DRw53 is co-expressed with DR4wl5 and the 
determination of the specificity of the DR4wl5 binding assay is complicated in that the 

20 same radiolabeled ligand is used for both the DR4wl5 and DRw53 binding assays. Since 
typically pi chains are expressed at 5-10 fold higher levels than other P chains, and all 
binding assays are performed utilizing limiting DR amounts, it would be predicted that the 
dominant specificity detected in the assay would be DR4wl5. To verify that this was 
indeed the case, the binding pattern of a panel of 58 different synthetic peptides in the 

25 putative DR4wl5 specific assay with that obtained in a DRw53 specific assay (which uses 
a DRw53 fibroblast as the source of Class II molecules). Two very distinct binding 
patterns were noted, and in several instances, a peptide bound to one DR molecule with 
high affinity, and did not bind to the other (data not shown). 

DR6wl9. The DR6wl9 assay utilizes as the source of Class II molecules 

30 the EBV transformed homozygous cell line H0301, which co-expresses DRB30301 

(DR52a). While the radiolabeled ligand used in the DR6wl9 assay is different than that 
used for the DR52a assay, the ligand is related (i.e., is a single substitution analog) to a 
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high affinity DR52a binder. As was done in the case of DR4wl5, the specificity of the 
assay was investigated by analyzing the binding capacity of a panel of naturally occurring 
peptides for DR6wl9 and DR52a. The two assays demonstrated completely different 
binding specificities. For example, in terms of relative binding, TT 1272-1284 binds 
5 63-fold better in the DR52a assay than in the DR6wl9 assay. Conversely, the Invariant 
chain peptide binds 189-fold better in the DR6wl9 assay. In conclusion, these data 
demonstrated that the binding of the radiolabeled peptide 650.22 to purified Class II MHC 
from the H0301 cell line is specific for DR6wl9. 

DR8w2 and DR8w3. The pi specificity of the DR8w2 and DR8w3 assays 
10 is obvious in that no p3 (and/or B4 and p5) molecule is expressed. 

DR9. The specificity of DR9 assay is inferred from previous studies which 
have shown that the TT 830-843 radiolabeled probe peptide does not bind to DRw53 
molecules (Alexander, etaL, Immunity 1:751 (1994)). 

15 Results 

DR binding affinity of antigenic peptides recognized by DR restricted T cells 

To define a threshold DR binding affinity, to be considered as biologically 
significant, we compiled the affinities of a panel of 32 reported instances of DR restriction 
of a given T cell epitope. In approximately half of the cases, DR restriction was 

20 associated with affinities of less than 100 nM, and in the other half of the instances, with 
IC50% in the 100-1000 nM range. Only in 1 out of 32 cases (3. 1 %) DR restriction was 
associated with IC50% of 1000 nM or greater. It was noted that this distribution of 
affinities differs from what was previously reported for HLA class I epitopes, where a vast 
majority of epitopes bound with IC50% of 50 nM or less (Sette, el al , JI, 1994). This 

25 relatively lower affinity of class II restricted epitope interactions might explain why 

activation of class II restricted T cells in general requires more antigen relative to class I 
restricted T cells. 

In conclusion, this analysis suggested that 1000 nM may be defined as an 
affinity threshold associated with immunogenicity in the context of DR molecules, and for 
30 this reason a suitable target for our studies. 



PI and P6 anchors are necessary but not sufficient for DRB10401 binding 
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Several independent studies have pointed to a crucial role in DRB 10401 
binding of a large aromatic or hydrophobic residue in position 1, near the N-terminus of 
the peptide and of a 9-residue core region (residues 1 through 9). In addition, an 
important role has been demonstrated for the residue in position six (P6) of this 9-residues 
5 core region. Short and/or hydrophobic residues were in general preferred in this position 
(O'Sullivan, etal. y JI 147:2663, 1991; Sette, etal y JI 151:3163, 1993; Hammer, etal, 
Cell 74:197, 1993 and Marshall, et ai, JI 154:5927, 1995). 

In the present set of experiments, a library of 384 peptides was analyzed for 
DRB 10401 binding capacity and screened for the presence of the P1-P6 motif (that is, F, 

10 W, Y, L, I, V or M in PI and S, T, C, A, P, V, I, L or M in P6, at least 9 residues apart 
from the peptide C-tenninus. This set of 384 peptides contained a total of 80 DR4w4 
binders (specifically 27 good binders [IC50 of 100 nM or less], and 53 intermediate 
binders [IC50 of the 100-1000 range]. Seventy-seven out of the 80 DR4w4 binders (96%) 
carried the P1-P6 motif. However, it should be noted that most non-DR4w4 binding 

15 peptides also contained the P1-P6 motif. Of 384 peptides included in our database, only 
125 were "P1-P6 negative." Only three of them (6%) bound appreciably to purified 
DR4w4 as opposed to 77/259 (30%) of the "P1-P6 positive' 1 peptides. Therefore, these 
results demonstrate that presence of suitable PI and P6 anchors are necessary but not 
sufficient for DRB10401 binding, 

20 

A detailed map of DRB10401 peptide interactions 

Next, for each P1-P6 aligned core region, in analogy with what the strategy 
previously utilized to detail peptide class I interactions the average binding affinity of 
peptides carrying a particular residue, relative to the remainder of the group, were 

25 calculated for each position. Following this method a table of average relative binding 
(ARB) values was compiled. This table also represents a map of the positive or negative 
effect of each of the 20 naturally occurring amino acids on DRB 10401 binding capacity 
when occupying a particular position, relative to the main P1-P6 anchors (Figure 1), 

Variations in ARB values greater than four fold (ARB 4 or <; 0.25) were 

30 arbitrarily considered significant and indicative of secondary effects of a given residue on 
DR-peptide interactions. Most secondary effects were associated with positions 4, 7, and 
9. These positions correspond to secondary anchors engaging shallow pockets on the DR 
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molecule. In addition, significant secondary effects were detected for M in position 3 
(ARB = 12.8) T in position 3 (ARB = 4.34) and I in position 5 (ARB = 4.4). 

Development of a DRB10401 specific algorithm 

5 Next, the ARB table was utilized to develop a DRB10401 specific 

algorithm, In order to predict 0401 binding propensity, each aligned P1-P6 sequence was 
scored by multiplying, for each position, the ARB value of the appropriate amino acid. 
According to this procedure, a numerical "algorithm score" was derived. If multiple 
P1-P6 alignments were possible, binding scores were calculated for each one and the best 

10 score was selected. The efficacy of this method in predicting 0401 binding capacity is 
shown in Table Ha. 

Considering only peptides with algorithm scores above -17.00 narrowed the 
set of predicted peptides to 156. This set still contained 72 out of 80 (90%) of the total 
high or intermediate DR binders. Raising the cut-off to an algorithm score of -16.44 or 

15 higher still allowed identification of 60 out of 80 (75%) of the DR4w4 binding peptides. 
Of the whole 107 peptide set, twenty-five of them were either good or intermediate 
binders. In other words, as expected, increasing the algorithm score stringency predicted 
a smaller fraction of the total binders present in the set, but at the same time less false 
positive peptides were identified. 

20 

Blind test of the predictive power of the DRB10401 specific algorithm 

To verify that the predictive capacity of our algorithm was not merely a 
reflection of havirig utilized the same data set to test and define the algorithm itself, we 
further examined its efficacy in a blind prediction test. For this scope we utilized data 

25 from an independent set of 50 peptides, whose binding affinities were known, but that had 
not been utilized in the derivation of the algorithm. As shown in Table lib, the algorithm 
was effective in predicting DR4w4 binding capacity of this independent peptide set. The 
algorithm score of -17.00 identified a total 18 peptides. This set contained 3/3 (100%) of 
all good binders, and 8/11 (70%) of all intermediate binders in the entire test set of 50 

30 peptides. Increasing the cut-off value to -16.44, identified a set of nine peptides. Seven 
of them (78%) were either good or intermediate binders. This set contained 7 out of 14 
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(50%) of the binders contained in the blind prediction peptide set. In conclusion, these 
data supports the validity of the DR4w4 specific algorithm described above. 

Detailed maps of DRB10401, DRB10101, and DRB10701 peptide binding specificities 
5 Next, we analyzed the binding to purified DR1 and DR7 molecules for the 

same set of 384 peptides utilized to define the DR4w4 algorithm. It was found that this set 
contained 120 and 59 binders for the DR1 and DR7 alleles, respectively. A total of 158 
peptides were capable of binding either DR1 , DR4w4 or DR7. A large fraction of them 
(73/158; 46%) were also degenerate binders, which bound two or more of the three alleles 
10 thus far considered. Furthermore, we also found that more than 90% of the DR1 or DR7 
good and intermediate binders carried the P1-P6 motif. Most importantly, 72 out of 73 
(99 %) degenerate DR binders carried this motif (data not shown) . In conclusion, this 
analysis suggests that P1-P6 based algorithms might be utilized to effectively predict 
degenerate DR binders. 

15 In analogy with what was described above for DR4w4 molecules, specific 

algorithms were designed for the DR1 and DR7 alleles. Figures 2A and 2B detail the 
allele specific maps defined according to this method. 

As in the case of DRB 10401, most secondary effects were concentrated in 
positions 4, 7 and 9. Position 4 was especially prominent in the case of DR1, while 

20 position 7 was the most prominent secondary anchor for DR7. Specific algorithms were 
developed based on these maps, and it was found that the cut-off values necessary to 
predict 75% or 90% of the binders were -19.32 and -20.28 for DR1, and 20.91 and 
-21.63 for DR7, respectively. Depending on the particular allele or cut off value selected, 
40 to 60% of the predicted peptides were in fact good or intermediate binders (data not 

25 shown). 

Development of a DR1-4-7 combined algorithm 

Finally, we examined whether a combined algorithm would allow to predict 
degenerate binders. For this purpose, the sequences of the 384 peptides in our database 
30 were simultaneously screened with the three (DR1, 4w4, and 7) specific algorithms. It 
was found that an even 100 peptides were predicted (using the 75% cut off) to bind either 
two or three of the alleles considered. This set contained 59 out of 73 (81 %) of the 
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peptides which were in fact capable of degenerate 1-4-7 binding (defined as the capacity to 
bind to more than one of the DR1, 4w4 or 7 alleles) (Table III). 

Definition of a target set of DR specificities, representative of the world population 

5 The data presented in the preceding sections illustrates how peptides capable 

of binding multiple DR alleles can be identified by the use of a combined "1-4-7" - 
algorithm. Next, we wished to examine whether the peptides exhibiting degenerate 1-4-7 
binding behavior would also bind other common DR types as well. As a first step in our 
experimental strategy, we sought to define a set of target DR types representative of a 

10 large 80%) fraction of the world population, irrespective of the ethnic population of 

origin. For this purpose, seven additional DR antigens were considered. For each one of 
the DR antigens considered in this study, (including DR1, 4 and 7), the estimated 
frequency in various ethnicities, according to the most recent HLA workshop (11th, 1991) 
is shown in Table IVa, together with the main subtypes thus far identified. 

15 For the purpose of measuring peptide binding affinity to the various DR 

molecules, one representative subtype for each DR antigen was chosen (Table I). It 
should be noted that for most antigens, either one subtype is by far the most abundant, or 
alternatively a significant degree of similarity in the binding pattern displayed by the 
different, most abundant subtypes of each DR antigen is likely to exist (see comments 

20 column of Table IVb) . One exception to this general trend is represented by the DR4 

antigen, for which significant differences in peptide specificity between the 0401 and 0405 
have been reported. Since both alleles are quite frequent (in Caucasians and Orientals, 
respectively) we included both DR 0401 and 0405 in the set of representative DR binding 
assays. 

25 Our set of representative assays is mostly focused on allelic products of the 

gene, because these molecules appear to be the most abundantly expressed, serve as the 
dominant restricting element of most human class III responses analyzed thus far, and 
accurate methods for serologic and DNA typing most readily available. However, we 
have also considered in our analysis assays representative of DRB3/4/5 molecules (Table 

30 IVc). These molecules serve as a functional restriction element, and their peptide binding 
specificity has been previously shown to have certain similarities to the specificity of 
several common DRp, allelic products. 
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A general strategy for prediction of DR-degenerate binders. 

To test whether the 1-4-7 combined algorithm would also predict degenerate 
binding to other common DR types, we measured the capacity of three different groups of 
synthetic peptides to bind the panel of purified HLA DR molecules. The three different 
5 peptide sets were: A) 36 peptides which did not score positive in the combined 1-4-7 
algorithm (non-predictions), B) 36 peptides which did score positive for the 1-4-7 
algorithm, at the 75 % cut off level, but had been found upon actual testing not to be 
degenerate 1-4-7 binders ("wrong" predictions), and C) 29 peptides which scored positive 
in the 1-4-7 algorithm, and also proved upon experimental testing, to be actual 1-4-7 
10 degenerate binders (correct predictions). The results of this analysis are shown in Table 
V. 

Within the set of "non-predictions" peptides (Table Va) only 3 out of 34 
(9%) bound at least two of the DR1, 4w4 or 7 molecules. Interestingly, 2 (1136.04 and 
1136.29) out of 3 of these peptides were also rather crossreactive, and bound additional 

15 DR types (DR2w2 P2, DR4wl5, 5wll and 8w2 in the case of 1136.04, and 2w2 P2, 

4wl5, 9 and 5wl2 in the case of 1136.29). Peptides from the "wrong predictions" peptide 
set (Table V5), by definition bound at the most only one of the DR1, 4w4 or DR7 
molecules, and were also poorly degenerate or other DR types with 
only two peptides (1136.22 and 1188.35) binding a total of three DR molecules. Within 

20 this peptide set, no peptide bound four or more of the DR molecules tested (data not 
shown). 

These results are contrasted by data obtained with the peptide set 
corresponding to peptides which were first predicted by the use of the combined 1, 4, 7 
algorithm, and then experimentally found to be degenerate DR 1-4-7 binding. Fourteen out 
25 of 29 peptides tested (48%) bound a total of five or more alleles. Four of them were 
remarkably degenerate (1188.16, 1188.32, 1188.34 and F107.09) and bound a total of 
nine out of the 1 1 DR molecules tested. In conclusion, these results suggest that a strategy 
based on the sequential use of a combined DR1, 4, 7 algorithm and quantitative DR1, 4, 7 
binding assays can be utilized to identify broadly crossreactive DR binding peptides. 



30 
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Definition of the HLA-DR 1-4-7 supertype 

The data presented above also suggested that several common DR types are 
characterized by largely overlapping peptide binding repertoires. When this issue was 
analyzed in more detail, by analyzing the binding pattern of the thirty-two peptides from 
5 Table Va and b which were actual DR1-4-7 degenerate binders. Thirty-one of them (97%) 
bound DR1, 22 (69%) DR4w4 and 21 (66%) DR7. These files are contrasted with the 
low percentages of binding observed amongst the remainder non-degenerate binding 
peptides (17/67 (25%), 8/67 (12%) and 7/67 (10%), for DR1, 4w4 and 7, respectively) 
(Table VII). 

10 Interestingly, a large fraction of the 1-4-7 degenerate binders also bound 

certain other common DR types. Sixteen (50%) bound DR2w2a, 18 (56%) DR6wl9, 18 
(56%) DR2w2b and 20 (62%) DR9. In all cases, the frequency of binding in the non-1-4- 
7 degenerate peptide set was much lower (Table VIII). 

Significant, albeit lower, frequencies of cross reactivity were noted also for 

15 DR4wl5, DR5wll, and DR8w2 (in the 28 to 37% range). Finally, negligible levels of 
cross reactivity were observed in the case of DR3 and 5wl2 and DR53. Further studies 
will address whether either of these two group of molecules (DR4wl5, 5wll, and 8w2 on 
one hand, and DR3, DR53 and 5wl2 on the other) might belong to different DR 
supertypes. 

20 In conclusion, these data demonstrates that a large set of DR molecules 

encompassing DR1, 4w4, 2w2a, 2w2b, 7, 9 and 6wl9 is characterized by largely 
overlapping peptide binding repertoires. 

Discussion 

25 In the present report we have analyzed the peptide binding specificity of a 

set of 13 different DR molecules, representative of DR types common among the 
worldwide population. Detailed maps of secondary anchors and secondary interactions 
have been derived for three of them (DR4w4, DR1 and DR7). Furthermore, we 
demonstrated that a set of at least seven different DR types share overlapping peptide 

30 binding repertoires; and consequently that broadly degenerate HLA DR binding peptides 
are a relatively common occurrence. This study also describes computerized procedures 
which should greatly assist in the task of identification of such degenerate peptides. 
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We would like to discuss the data in the context of our current 
understanding of peptide-class II interactions, as well as in the context of the recently 
described class I supermotifs. Finally, the potential implications of broadly degenerate 
class II epitopes for epitope based vaccine design should also be considered. 
5 Firstly, our studies illustrate how the vast majority of the peptides binding 

with good affinity to DR4w4, DR1, DR7 and most of the other DR types analyzed in the 
current study (data not shown), are all characterized by a P1-P6 motif consistent with the 
one originally proposed by O' Sullivan, et ah Crystallographic analysis of DRl-peptide 
complexes revealed that the residues occupying these positions engage two complementary 

10 pockets on the DR1 molecule, with the PI position corresponding to the most crucial 
anchor residue and the deepest hydrophobic pocket. Our analysis also illustrates how 
other "secondary anchor" positions drastically influence in an allele-specific manner 
peptide binding capacity. Position 4 was found to be particularly crucial for DR1 binding, 
position 9 for DR4w4, and position 7 for DR7. These data are consistent with previous 

15 results which originally described such allele-specific anchors, and with crystallographic 
data which illustrates how these residues engage shallow pockets on the DR molecule. 

Secondly, our studies illustrate how an approach based on alignment and 
calculation of average relative binding values of large peptide libraries allows definition of 
quantitative algorithms to predict binding capacity. The present study extends those 

20 observations to two other common HLA-DR types, and also illustrates how the combined 
use of the 1-4-7 algorithms can be of aid in identifying broadly degenerate DR binding 
peptides. 

The data presented herein suggest that a group of common DR alleles, 
including at least DR1, DR2w2a, DR2w2b, DR4w4, DR6wl9, DR7 and DR9 share a 

25 largely overlapping peptide repertoire. Degenerate peptide binding to multiple DR alleles, 
and recognition of the same epitope in the context of multiple DR types was originally 
described by Lanzavechia, Sinigallia's and Rothbard's groups. The present study provides 
a classification of alleles belonging to a main HLA-DR supertype (DRl-4-7-like) which 
includes DR1, DR2w2a, DR2w2b, DR4w4, DR7, DR9, DR6wl9. On the basis of the 

30 data presented herein, at least two additional groups of alleles exist. The first group 

encodes for molecules with significant, albeit much reduced overlap with the 1-4-7-like 
supertype (DR4wl5, 8w2, 5wll). The second group of alleles (5wl2, 3wl7, and w53) 
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clearly has little repertoire association with the 1-4-7 supertype. In this context it is 
interesting to note that Hammer, et al noted that good DR5wl 1 binding peptides are 
frequently characterized by positively charged P6 anchor (which would be poorly 
compatible) with the herein proposed 1-4-7 supermotif. It is also interesting to note that 
5 Sidney, et al. proposed that DR3wl7 binds a set of peptides largely distinct from those 
bound by other common DR types. Future studies will have to determine whether any of 
the molecules listed above can be grouped in additional DR supertypes. Our group is 
currently investigating whether analysis of polymorphic residues lining the peptide binding 
pockets of DR can be utilized to aid in the classification and prediction of HLA DR 
10 supertypes. 

We would like to comment on similarities and differences between the HLA 
DR supertype described herein and the recently described HLA class I supermotifs. Class 
I supermotifs are clear-cut and, as a rule, non-overlapping. Four of them have been 
described all approximately equally frequent amongst the worldwide population. By 

15 contrast, the repertoire defining the HLA DR supertype herein described is not clear-cut 
and overlaps, at least in part, with the repertoire of other alleles. It also appears that on 
the basis of the data presented in Tables I and IV, even if other DR supertypes exist, the 
DR1-4-7 is going to be by far the most abundantly represented worldwide. 

Finally, we would like to point out the possible relevance of these data in 

20 terms of development of epitope based vaccines. Class II restricted HTL have been 

implicated in protection from, and termination of many important diseases. Inclusion of 
well defined class II epitopes in prophylactic or therapeutic vaccines may allow to focus 
the immune response towards conserved or subdominant epitopes, and avoid suppressive 
determinants. Based on the data presented herein (Table IV), the DR1-4-7 supertype 

25 would allow coverage in the 50 to 80% range, depending on the ethnicities considered. It 
is thus possible that broad and not ethnically biased population coverage could be achieved 
by considering a very limited number of peptide binding specificities. 

Based on the results present above, the sequences of various antigens of 
interest were scanned for the presence of the DR 1-4-7 motifs. Peptides identified using 

30 this approach are broadly cross reactive, class II restricted T cell epitopes. Table VIII 
presents a listing of such peptides derived from HBV, HCV, HIV and Plasmodium 
falciparum (Pf). A total of 146 peptides were identified: 35 from DHBV, 16 from HCV, 
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27 

50 from HIV, and 45 from Pf. Standard conservancy criteria were employed in applicable 
cases. 

The above examples are provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary skill 
in the art. All publications, patents, and patent applications cited herein are hereby 
incorporated by reference for all purposes. 
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Table II 



An algorithm to predict DRBl* 0401 binding capacity, 
a) Original peptide set 

No. of peptides (Binding nM) 



Selection 
Criteria 


High 
<a00 


Inter. 
100-1000 


Non 
>1000 


Total 


None 


27 


53 


304 


364 


P1-P6 


27 


50 


182 


259 


-17.00 " 


27 


45 


84 


156 


.16.44 a 


25 


35 


47 


107 



1) 

2) 



Algorithm score which predicts 90% of all binders. 
Algorithm score which predicts 75% of all binders. 
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Table II 

b) Blind test of the predictive power of the DRB1*0401 algorithm. 

No. of peptides (Binding nM) 

Selection High Inter. Non 

Criteria £100 100-1000 >1*000 Total 



None 


3 


11 


36 


50 


P1-P6 


3 


9 


28 


40 


-17.00 


3 


8 


7 


18 


-16.44 


3 


4 


2 


9 
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Table III 
A combined "1-4-7" algorithm. 



Selection 
Criteria 



Degenerate 
Binders " 



Percent of Total 
Degenerate Binders 



None 
P1-P6 



Combined Algorithms 
(90% Cutoff Value) 



Combined Algorithms 
(75% Cutoff Value) 



73/384 
72/259 

67/147 
59/100 



100% 
99% 

92% 
81% 



1) Degenerate binders are defined as peptides binding at least two out of the three 
DR1, 4w4, and 7 molecules with an IC50 of 1 nM or less. 
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Table IV 

Phenotypic frequencies of 10 prevalent HLA-DR antigens 



Phenotypic Frequencies 



Antigen 


Alleles 


Cauc. 


Blk. 


Jpn. 


Chn. 


Hisp. 


Avg. 


DR1 


DRB1MH01-03 


18.5 


8.4 


10.7 


45 


10.1 


10.4 


DR2 


DRB1»1501-03 


19.9 


14.8 


30.9 


22.0 


15.0 


20.5 


DR3 




17.7 


19.5 


0.4 


7.3 


14.4 


11.9 


DR4 


DRBl»040t-12 


23.6 


6.1 


40.4 


21.9 


29.8 


24.4 


DR7 


DRBr0701-O2 


26.2 


11.1 


1.0 


15.0 


16.6 


14.0 


DR8 


DRBl # 0801-5 


5.5 


10.9 


25.0 


10.7 


23.3 


15.1 


DR9 


DRB1 '09011,09012 


3.6 


4.7 


245 


19.9 


6.7 


11.9 


DR11 


DRB1M 101-05 


17.0 


18.0 


4.9 


19.4 


18.1 


15.5 


DR12 


DRBl'1201-02 


2.8 


5.5 


13.1 


17.6 


5.7 


8.9 


DR13 


DRB1M301-06 


21.7 


16.5 


14.6 


12.2 


105 


15.1 
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Table VII 



1-4-7 Non 1-4-7 

Degenerate Degenerate 
Binders (%) Binders (%) 



! 31/32 (97) 17/67 (25) 

4w4 22/32 (69) 8/67 (12) 

7 21/32 (66) 7/67 (10) 

~ 20/32 (62) " 2/67 (3.0) 

6wl9 18/32 (56) 6/67 (8.9) 

2w 2Bb W/32 (56) 16/67 (24) 

2w2fia 16/32 (50) 10/67 (15) 

4W 15 12/32 (37) ~" 4/67 (6.0) 

8w2 10/32 (31) 3/67 (4.5) 

5W H 9/32 (28) 6/67 (8.9) 

5wl2 3/32 (9.4) ~ 4/67 (6.0) 

3w l7 1/32 (3.1) 0/67 (0) 

w53 2/16 (13) 7/43 (16) 
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Table Vin 



Sequence 


Source 


1st 
Pot 


Conservancy 


Predicted 
1-4-7 


IGPFMKA VCVEVEKT 


Pf TRAP 


237 


inn 
1UU 


3 


ILSVFFLALFFUFN 


P/ EXP1 


3 




. 3 


KSKYKLAT5VLAGLL 


Pf EXPl 


71 




3 


KYKLATSVLACLLCN 


Pf EXP1 


T\ 

t o 




3 


LGNVKYLVTVFLIFF 


Pf TRAP 


A 
*t 


1 fYl 


3 


LSVFFLALFHIFNK 


Pf EXP1 


4 




3 


LVNLUFHINGKIIK ~ 


Pf LSA1 


11 

1*3 




3 


MKILSVFFLALFHl 


Pf EXP1 


1 
1 




3 


MRKLAILSVSSFLFV 


Pf CSP 


2 


05 


3 


NSSIGLIMVLSFLFL 


Pf CSP 


417 




3 


NVK YLVrVFLHTOL 


Pf TRAP 




inn 


3 


SFYFIL VN LUFHIN 


Pf LSA1 






3 


V hhLALFFIIFNKES 


Pf EXP1 


6 




3 


YFILVNLUFH1NGK 


Pf LSA1 


10 




3 . 


YISFYFILVNLUFH 


Pf LSA1 


6 






AGLLCNVSTVLLGCV 


Pf EXP1 


82 






ANQLWILTDGIPDS 


Pf TRAP 


153 


100 


2 


A YKFVVPG A ATPYAC 


Pf TRAP 


514 


80 


2 


DKELTMSNVKNVSQT 


Pf LSA1 


81 




2 


FN WN55ICUM VLS 


Pf CSP 


413 


100 


2 


FYFILVNLUFHING 


Pf LSA1 


9 




2 


CLAYKFWPCAATPY 


Pf TRAP 


512 


80 


2 


GRDVQNN[VDEIKYR 


Pf TRAP 


25 


90 


2 


HILY15FYFILVNLL 


Pf LSA1 


3 




2 


HNWVNHAVPLAMKU 


Pf TRAP 


62 


80 


2 


IVFUFFDLFLVNGR 


Pf TRAP 


12 


100 


2 


KFWPCAATPYAGEP 


Pf TRAP 


516 


60 


2 


KSIXRNLGVSENrR 


Pf LSA1 


98 




2 


KYLVIVFLIFFDLFL 


Pf TRAP 


8 


TOO 


2 


LAGLLGNV5TVLLGC 


Pf EXPl 


81 




2 


LCNVSTVLLCCVCLV 


Pf EXP1 


85 




2 


UFFDLFLVNGRDVQ 


Pf TRAP 


15 


100 


2 


LWILTDQPDSIQD 


Pf TRAP 


156 


100 


2 


QLWTLTDGIPDSIQ 


Pf TRAP 


155 


100 


2 


RGYYIPHQ55LPQDN 


Pf LSA1 


1669 




2 


RHN W VNHAVPLAMKL 


Pf TRAP 


61 


80 


2 


RHPFKIGSSDPADNA 


Pf EXPl 


107 




2 


S5VFNWN5SIGUM 


Pf CSP 


410 


95 


2 


VFNWNS5IGUMVL 


Pf CSP 


412 


95 


2 


VKNV1GPFMKAVCVE 


Pf TRAP 


223 


100 


2 


VKYLVrVFLEFFDLF 


Pf TRAP 


7 


100 


2 


VSTVLLCCVGLVLYN 


Pf EXP1 


88 




2 


WENVKNVIGPFMKAV 


Pf TRAP 


220 


100 


2 


YKFWPCAATPYACE 


Pf TRAP 


515 


80 


2 
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Table VIII 



Sequence S 


Source 


1st 
Po. 


Conservancy 


Predicted 
1-4-7 


ENRWQVMIVWQVDRM I 


-AVI ViF 


2 


81 


3 


ERYLKDQQLLOWGCS I 


ENV 


589 




3 


ESELVSQUEQLIKK I 


4IV1 POL 


696 


80 


3 


FRKYTAFITPSINNE I 


-nvi POL 


303 


93 


3 


GQMVHQAISPRTLNA ] 


-nvi GAG 


172 


88 


3 


IPEWEFVNTPPLVKL 1 


4IV1 POL 


593 


93 


3 


LPPWAKETVASCDK 1 


-nvi pol 


770 


87 


3 


NREILKEPVHCVYYD 1 


rflVl POL 


4S5 


87 


3 


PAIFQS5MTKILEPF 1 


MVl POL 


336 


80 


3 


PPVVAKEP/ASCDKC ] 


rflVl POL 


771 


87 


3 


QEQIGWMTN NPPIPV ] 


HIV1 GAG 


276 


81 


3 


QGQMVHQAI5PRTLN 


HIVl GAG 


171 


85 


3 


SPAIFQSSMTKILEP 


HIV1 POL 


335 


80 


3 


TLNFP1SPIETVPVK 


HIVl POL 


176 


100 


3 


VKNW MTETLLVQN AN 


HIV1 GAG 


343 


81 


3 


VPVWKEATTTLFCAS 


HIVl ENV 


54 


81 


3 


WEFVNTPPLVKLWYQ 


HIVl POL 


596 


93 


3 


WVKWEEKAFSPEVI 


HIV GAG 


187 


33 


3 


YYGVPVWKEATTTLF 


HIVl ENV 


51 


83 


3 


ASDFNLPPWAKETV 


HTVl POL 


765 


80 


2 


ASCYIEAEVIPAETC 


HTVl POL 


822 


93 


2 


DFNLPPVVAKEIVAS 


HTVl POL 


767 


87 


2 


EAHRILQQLLFIHF 


HIVl VPR 


58 


82 


2 


EKVYLAVWPAHKQG 


HIVl POL 


711 


93 


2 


ETAYFLLKLACRVVPV 


HIV POL 


838 


65 


2 


EVQLCIPHPACLKKK 


HIVl POL 


268 


80 


2 


FWEVQLGIPHPAGLK 


HTVl POL 


266 


100 


2 


GCTLNFPISPIEIVP 


HIVl POL 


174 


100 


2 


CHYKRWIILCLNKI 


HIVl GAG 


294 


85 


2 


CTVLVCPT"PVNnCR 


HIVl POL 


153 


100 


2 


HKAICTVLVCPTPVN 


HIVl POL 


149 


93 


2 


IGTVLVGFTPVNIIG 


HIV POL 


152 


74 


2 


KR WLILGLN KIVRMY 


HTVl GAG 


298 


68 


2 


KVYLAVWPAHKG1GG 


HIV POL 


712 


74 


2 


UCTTAVPWNASWSNK 


HIVl ENV 


607 




2 


LLQLTVWGIKQLQAR 


HIVl ENV 


731 


80 


2 


NFFISPICTVPVKLK 


HIVl POL 


178 


100 


2 


PQGWKGSPA1FQSSM 


HIVl POL 


329 


87 


2 


PVNOGRNLLTQIGC 


HIVl POL 


161 


87 


2 


QHLLQLTVWGIKQLQ 


HIVl ENV 


729 


80 


2 


QQHLLQLTVWCIKQL 


HIVl ENV 


728 


80 


2 


SPEVIPMFSALSEGA 


HTVl GAG 


197 


88 


2 


TKELQKQITKIQNFR 


HIV POL 


952 


67 


2 


TVLVGPTPVNI1CRN 


HTVl POL 


154 


100 


. 2 


VEAimiLQQlXHH 


HIVl VPR 


57 


82 


2 


VIPMF5ALSECATPQ 


HIVl GAG 


200 


88 


2 


VNIICRNLLTQICCT 


HTVl POL 


162 


87 


2 


WGCSGKUOTAVPWN 


HIVl ENV 


601 




2 


WI1 LGLN KTV RM Y5P 


HIVl GAG 


300 


88 


2 


YKRWULGLNKIVRM 


HIVl GAG 


297 


88 


2 


FILVNLUFHINGKI 


Pf LSAl 


11 




3 



Page 2 of 3 



WO 98/32456 



39 



PCT/US98/01373 



Table VIII 



Sequence 


Source 


1st 

POB 


Conservancy 


Predicted 
1-4-7 


AEDLNLCNLNVSIPW ~ 


HBV POL 


00 


DC 

95 


3 


DLNLCNLNVSIPVVTH 


HBV POI 


in 




3 


GhhU-llULTIPQSL 


HBV PMV 
**** v v 


1R1 
lot 


on 


3 


EFLHIXLCUFLLV 


HBV FMV 


OAS. 


on 

BO 


3 


NLNVSIFWTHKVGNF 


HBV POL 


45 


Sf5 


3 


PFLLA QFT5 AJCSW 


HBV POI 




95 


3 


RF5WLSLLWFVQWF 


HBV ENV 




1UU 


3 


SPFLLAQFTSAICSV 


HBV POL 


522 


oc 

70 


3 


SVRF5WLSLLVPFVQ 


HBV ENV 




oil 


3 


AP5YMDDWLGAKSV 


HBV POL 


546 


on 


2 


AGFFLLTRJLTIPQS 


HBV ENV 


180 


on 
OAJ 


2 


FVQWFVGL5PTVWLS 


HBV ENV 


342 


yo 


0% 

2 


CAHLSLRCLPVCAFS 


HBV X 


50 


on 


*> 
2 


GT5FVYVPSALNPAD 


HBV POL 


774 


ou 


2 


GVWIRTPPAYRPPNA 


HBV NUC 


123 




2 


HLSLRGLPVCAFSSA 


HBV X 




on 


2 


IlhU-lLLLCURL 


HBV ENV 




OU 


2 


ILLLCUFLLVLLDY 


HBV ENV 






2 


IVGLLGFA APFTQCG 


HBV POL 




on 


•j 

2 


KFAVPNLQSLTNLLS 


HBV POL 


406 




*> 


LAQFTSAICSWRRA 


HBV POL 


526 


95 




LCUFLLVLLDYQGM 


HBV ENV 


252 


95 


2 


LCQVFA D ATPTCWCL 


HBV POL 


694 


95 


2 


LHLYSHPIILGFRKI 


HBV POL 


501 


80 


2 


LL CUFLL VLLDYQG 


HBV ENV 


251 


95 


2 


LVLLDYQCMLPVCPL 


HBV ENV 


258 


90 


2 


LVPFVQWFVCLSFTV 


HBV ENV 


'339 


95 


2 


PLPIHTAELLAACFA 


HBV POL 


722 


80 


2 


QCCYPALMPLYACIQ 


HBV POL 


648 


95 


2 


RDLLDTASALYREAL 


HBV NUC 


28 


80 


2 


SFCVWIRTPPAYRPP 


HBV NUC 


121 


90 


2 


SWLSRKYTSFPWLL 


HBV POL 


j 750 


85 


2 


VCLLCFAAPFTQCGY 


HBV POL 


637 


95 


2 


VPN LQSLTN LLSSNL 


HBV POL 


409 


85 


2 


WPKFAVPNLQ5LTNL 


HBV POL 


404 


95 


2 


KVLVLNPSVAATLCF 


HCV 


1255 


100 


3 


PTLWARMILMTHFF5 


HCV 


2870 


79 


3 


ADLMGYIPLVGAPLG 


HCV 


131 


79 


2 


AVQWMNRUAFA5RC 


HCV 


1917 


100 


2 


DLEUTSCSSNVSVA 


HCV 


2812 


93 


2 


DLYLVTRHADVIPVR 


HCV 


1134 


79 


2 


EDLVNLLPAILSPGA 


HCV 


1882 


79 


2 


FTTLPALSTGUHLH 


HCV 


684 


79 


2 


GARLWLATATPPGS 


HCV 


1345 


79 


2 


GI Q YL AGLSTLPGNP 


HCV 


1776 


100 


2 


CVNYATCNLPCCSFS 


HCV 


161 


79 


2 


IQYLAGL5TLPGNPA 


HCV 


1777 


100 


2 


LH CLS AF5LHSYSPG 


HCV 


2919 


79 


2 


VNLLPAILSPCALW 


HCV 


1885 


79 


2 


VQ W MNKLI AFASRGN 


HCV 


1918 


100 


2 


YKVLVLNPSVAATLG 


HCV 


1254 


100 


2 
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Class II Peptides 

Peptide A A Sequence Source 



008.00 1 e SALLS5DITASVNCAK 
200.06 16 SALSEGATFODLMML 
213.10 16 NKALELFRKDIAAKYK 
506 01 ZO NKALELFRKDIAAKYKELGY 

506.03 1 8 ALELFRKDIAAKYKELGY 

506.05 1 G ELFRKDIAAKYKELGY 
S70 01 1 6 MAKTI AYDEE ARRGLE 

705.06 20 KVYLPRMKMEEKYNLTSVLM 

717.04 14 YASFVKTTTLRKFT-NH2 
857.02 ' 20 PHHTALRQAILCWGELMTLA 

865.01 1 S YKMKMVHAAHAKMKM 
F050.D3 20 GFYTTGAVRQIFGDYKTTIC 
F089.01 15 QNIULSNAPLGPQFP 
FO98.03 20 AAYAAOGYKVLVLNPSVAAT 
F09B.04 20 QYKVUVLNPSVAATLGFGAY 
F008.05 14 G Y KVLVLNPS VAAT 
F098.06 1 9 SYVNTNMGLKFRQLLWFHI 
F098.10 1 2 GLKFRQULWFH 

Fl 34.04 20 TLHGPTPLLYRLGAVQNElT 

FU4.05 20 NFISGIQYLAGLSTLPGNPA 

F 134.08 21 GEGAVQWMNRUAFASRGNHV 

IA.p5 17 KPVSQMRMATPULMRPM 

Tf-28 p1 24 LPKPPKPVSKMRMATPLLMOALPM 

27.0279 1 5 EYLVSFGVW1RTPPA 

27 0280 1 5 GVW1RTPPAYRPPNA 

27.0281 1 5 RHYLKTLWKAGILYK 

27.0283 1 5 VPNLOSLTNUSSNL 

27.0288 1 5 WVTVYYGVPWVKEAT 

27.0293 1 5 YYGVPVWKEATTTLF 

27.0294 1 5 VPWVKEATTTLFCAS 

27.0295 1 5 USGIVOQQNNORAJ 

27.0296 15 OOHULQLTWGIKOU 

27.0297 15 QHLLDLTVWG1KOLD 

27.0298 15 LLQLTVWGIKQIQAR 
27.0304 15 QGQM VHQAl SPRTLN 
27.0307 15 SPEVIPMFSAL5EGA 

27.03 1 0 15 QEaGWMTNNPPiPV 

27.0311 15 GHYKRWllLGLNK! 

27 .031 2 1 5 YKRWIILGLNKtVRM 

27.0313 1 5 KRWULGLNKIVRMY 

27.0314 1 5 WULGLNKIVRMYSP 

27.0315 1 5 VKNWMTETLLVQNAN 
27.0322 15 GTVLVGPTPVNHGR 
27.0324 15 PVNIIGRNLLTOIGC 
27 0326 15 GRNLLTQ1GCTLNFP 
27 032 B 15 TLNFPISPIETVPVK 
27 0329 1 5 NFPISPIETVPVKLK 
27^0341 1 5 FRKYTAFTIPSINNE 
27 0344 15 SPAIFOSSMTKILEP 
27.0345 1 6 PAIFOSSMTKILEPF 

27.0349 1 5 QKLVGKLNWASQIYA 

27.0350 1 5 VGKLNWASQJYAGIK 

27.0351 1 5 NRE1LKEPVHGVYYD 
27 0353 1 5 IPEWEFVMTPPLVKL 
27.0354 15 WEFVNTPPV.VKLWYQ 
27.0360 1 5 EOLIWEKVYIAVWP 



HEL 81-06 
HIV fl p25 41-56 
Sp, W. myo, 132-147 
SW Myp 132-151 
Sp. W myo. 134-151 
Sp. W myo. 136-151 
Heat Shock Prot 
Ova 279-298 

combinatorial; DR2 optimized 

HBV core 50-69 

OVA KM core extension 

PLP 91-110 

Tyrosinase 66-70 

HCV NS3 1242-1261 

HCV NS3 1 248-1 2B7 

HCV NS3 1248-1261 

HBV Gore 87-105 

HBV Core 94-105 

HCV NS4 1-20 

HCV NS4 151-170 

HCV NS4 293-313 (1914-1934) 

Mouse im/ariam chain 85*101 

Human invariant chain 80-103 

HBV NUC 117 

HBV NUC 123 

HBV POL 145 

HBV POL 409 

HIV1 ENV 47 

HIVi ENV 51 

HIV1 ENV 54 

HIV1 ENV 711 

HIV1 ENV 728 

HIVI ENV 729 

HIV1 ENV 731 

HIV1 GAG 171 

HIVi GAG 197 

HIVI GAG Z76 

HIV1 GAG 294 

HIVI GAG 297 

HtVl GAG 298 

HIVI GAG 300 

HIV1 GAG 348 

HIV1 POL 153 

HIV1 POL 161 

HIV1 POL 166 

HIV1 POL 176 

HIV1 POL 178 

HIVI POL 303 

HIV1 POL 335 

HIVI POL 336 

HIV1 POL 437 

HIV1 POL 440 

HIV1 POL 485 

HIV1 POL 593 

HIV1 POL 596 

HIVI POL 705 
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Class 11 Peptides 

Source 



27.0361 


15 


EKVYLAWVP AH KQIQ 


HIV1 POL 711 


27.0364 


15 


HSNWRAMASDFNLPP 


HIVi POL 758 


27.0370 


15 


ASGYIEAEVIPAETG 


HIV1 POL 822 


27.0372 


15 


AEHLKTAVQMAVFIH 


HIV1 POL 911 


27.0373 


15 


KT AVQMAVFI H NFKR 


HIV1 POL 915 


27.0377 


15 


QKQTTX1QNFRVYYR 


HIV1 POL 956 


27.0379 


15 


KLLWKGEGAWIQDN 


HIV1 POL 982 


27.0381 


15 


ENRWQVMIVWQVDRM 


HIV1 V1F 2 


27.0382 


1 5 


VEA) IRILQQUFIH 


HIV1 VPR 57 


27.0384 


15 


FNWNSSJGUMVUS 


Pf CSP 413 


27.0387 


15 


MNYYGKQENWYSIKK 


Pf CSP 53 


27.03BB 


15 


MRKLAILSVSSFLFV 


Pf C3P 2 


27.0390 


15 


NSSIGUMVLSFLFL 


Pf CSP 417 


27.0392 


15 


SSVFMWNSSIOUM 


Pf CSP 410 


27.0393 


15 


MK1LSVFFLALFFH 


Pf EXP1 1 


27.0398 


15 


FtLVNUJFHtNGKI 


Pf. LSA1 11 


27.0400 


15 


HILYI5FYFILVNLL 


Pf LSA1 3 


27.0402 


15 


UJFHINGKHKNSE 


Pf LSA1 16 


27.0403 


15 


LVNLLIFWINGKIIK 


Pf LSA1 13 


27.0406 


IE 


NLUFH1NGKIIKNS 


Pt LSA1 15 


27.0408 


1 5 


OTNFKSLLRNLGVSE 


Pf LSA1 94 


27.0412 


15 


AYKFVVpGAATPYAG 


Pf SSP2 514 


27.0415 


1 5 


NVKYLVJVFUFFDL 


PI SSPZ 6 


27.0417 


1 5 


VKNVIGPFMKAVCVE 


Pt SSP2 223 


27.0418 


1 5 


WENVKNVIGPFMKAV 


PI SSP2 220 


1186.04 


1 5 


CSWRRAFPHCLAFS 


HBV POL 534 


1186.06 


1 5 


FVQWFVGLSPTVWLS 


HBV ENV 342 


1186.10 


IS 


LAQFTSAICSWRRA 


HBV POL 526 


1186,15 


1 5 


LVPFVQWFVGLSP7V 


HBV ENV 339 


1186. IB 


1 6 


NLSWUSLDVSAAFYH 


HBV POL 422 


11B6.25 


1 5 


SFGVWIRTPPAYRPP 


HBV NUC 121 


1186.26 


1 5 


SPFLLAQFTSAICSV 


HBV POL 5Z2 


1186.27 


1 5 


SSNLSWUSLDVSAAF 


HBV POL 420 


1188.01 


1 5 


OKELTmSNVWwSQT 


Pi L5A1 Bi 


1188.13 


1 s 


• «-st . Mini* i i/i i ^r**v/ 

AG LLG MV5TV LLGG V 


rl CAri oil 


1188.16 


1 6 


KSKY KLATSVLAGLL 


Pf cXPI 71 


1188.3Z 


1 5 


GLAYKFVVPGAATPY 


Pf SSP2 512 


1188.34 


15 


HNWVNHAVPLAMKU 


PI SSP2 62 


1188.35 


IS 


1GPFMKAVCVEVEKT 


Pf SSP2 227 


1188.38 


15 


KYKIAGG1AGGLALL 


PI SSP2 494 


1188.45 


1 5 


RHNWVNHAVPLAMKL 


Pt SSP2 61 


F091.15 


16 


IKOF1NMWQEVGKAMY 


HIV1 ENV 566 


F107.03 


1 5 


LQSLTNLLSSNLSWl 


HBV POL 412 


F107.04 


15 


PFLLAOFTSAJCSW 


HBV POL 523 


F1 07,09 


15 


KYKLATSVLAGLLGN 


Pf EXP1 73 


F1 07.10 


15 


LAGU.GNTVSTVUJGG 


Pf EXP1 81 


F107.11 


15 


RHPFKK3SS0PA0NA 


Pf EXP1 107 


F107.14 


1 5 


ANOLW1LTDGIPDS 


Pt SSP2 153 


F107.17 


15 


KFWPGAATPYAGEP 


Pt SSP2 516 


F1 07.23 


15 


VFNWNSSlGUMVL 


Pf CSP 412 


35.0093 


15 


VGPLTVNEKRRLKU 


HBV POL 96 


35.0096 


15 


ESRLWDFSOF5RGN 


HBV POL 387 


35.0100 


15 


LGQVF AD ATPTGWGL 


HBV POL 683 


35.0106 


15 


VVWATDALMTGYTG 


HCV 1437 


35.0107 


15 


TVDFSLDPTFHETT 


HCV 1466 


35.0125 


15 


AETFYVDGAANRETK 


HIV POL 619 
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Class II Peptides 

Peptide A A Sconce Sourc* 



35 on 2 7 15 EVNIVTDSQYAU3II 

35 0131 1« WAQIKQEFGUPYNPO 

35 0133 1 5 GAWtQDNSDJKWP 

35*0135 15 YRKILRORKIORUD 

35 0171 1 5 POSIQDSIKESRKLN 

35 0172 15 KCNLYADSAWENVKN 

1 280 02 1 S IGTVLVGPTPVNIIG 

1 280 03 1 5 KVYLAWVPAHKGJGG 

1280 04 1S TKELOKOITKIQNFR 

1 280.06 1 5 AGFFLLTR1LTIPOS 

1280.08 15 GFFILTRIL71PQSL 

1 280 09 1 5 GTSFVYVPSALNPAD 

128o!l2 15 IIFLFILLLCUFLL 

1 280. 1 3 1 5 KFAVPNLQSUTNLLS 

1280.15 15 LHLYSHPIILGFRKI 

1280 16 15 UCUFU.VUJDYQG 

1 280 2 1 1 5 VGU-GFAAPFTQCGY 

1280.22 15 FYFILVNLLIFHING 

1 280 23 1 5 KSULRMJGVSENIFL 

1280 25 1 5 RGYYIPHQSSIPODN 

1 283 02 1 S VYLLPRRGPRLGVRA 

1283.10 15 GHRMAWDMMMNWSPT 

1 283.1 1 1 5 CGPVYCFTPSPWVG 

1 283.1 2 1 6 VYCFTPSPWVGTTD 

1283.13 1 5 GNWFGCTWMNSTGFT 

1283.14 1 5 FTTLPALSTGUHLH 

1283.16 1 5 SKGWRLLAPITAYAQ 

1283.17 1 5 DLYIVTHHADVIPVR 

1 283 20 1 S AQGYKVLVLNPSVAA 

1283.21 15 GYKVLVLNPSVAATL 

1283.22 1S VLVLNP5VAATLGFG 
1283.24 1 5 QARLWLATATPPGS 
1283.26 1 5 DWWATDALMTGYT 
1283.30 1 5 FTGLTHIDAHFUSQT 

1 2 83 3 1 15 YLVAYQATVCARAQA 

12B 3.33 15 LEWTSTTVWLVGGVL 

1 283.34 1 5 TWVLVGGVLAALAAY 

1283.36 15 AKHMWNRSGtQYU 

1 283.37 1 5 IOYLAGLSTLPGNPA 
1283.44 15 MNRUAFASRGNHVS 
1283.50 15 SYTWTGAUTPCAAE 
1 283.55 1 5 GSSYGFOYSPGQRVE 
1283.57 15 LEUTSCSSNVSVAH 

1 283.61 1 5 ASCLRKUGVPPIBVW 

1298.02 1 5 VGNFTGLYSSTVPVF 

1298.03 15 TNFLLSLGIHLNPNK 

1298.04 1 5 KCCFRKtPVNRPlCW 

1298.06 1S KQAFTFSPTYKAFLC 

1298.07 1 5 AANW1LRGTSFVYVP 

1298.08 15 PORVHFASPLHVAWR 
1298 10 15 IRPWSTQUJLNGSL 

1 298 1 1 1 5 RSELYKYKWK1EPL 

129B .i3 15 DRFYKTtBAEQASGE 

y 298 16 1 5 JCVILVAVHVASGYIE 

F1 25 02 17 LVNLUFHINGKIIKNS 

F1 25.04 1 6 RHNWVNHAVPLAMKU 



HIV POL 674 
HIV POL 87 d 
HIV POL 989 
HIV VPU 31 
Pf SSP2 165 
Pf SSP2 211 
HIV POL 1 52 
HIV POL 712 
HIV POL 052 
HBV ENV 180 
HBV ENV 181 
HBV POL 774 
HBV ENV 244 
HBV POL 406 
HBV POL 501 
HBV ENV 251 
HBV POL 637 
Pf LSA1 * 
PI LSA1 9B 
Pf LSA1 1669 
HCV Cof 34 
HCV El 315 
HCV NS1/E2 506 
HCV NS1/E2 609 
HCV NS1/E2 550 
HCV NS1/E2 684 
HCV NS3 1025 
HCV NS3 1134 
HCV NS3 1251 
HCV NS3 1253 
HCV NS3 1256 
HCV NS3 1345 
HCV NS3 1436 
HCV NS3 1567 
HCV NS3 1591 
HCV N64 1658 
HCV NS4 1664 
HCV NS4 1787 
HCV NS4 1777 
HCV NS4 1921 
HCV NS5 2456 
HCV NS5 2641 
HCV NS5 2813 
HCV NS5 2930 
HBV POL 53 
HBV POL 568 
HBV POL 615 
HBV POL 661 
HBV POL 764 
HBV POL 824 
HIV1 ENV 333 
HIV1 ENV 637 
HIV1 GAG 333 
HIV1 POL 813 
Pf LSA1 13 
Pf SSP2 61 
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WHAT IS CLAIMED IS: 

1. A composition comprising an isolated peptide that induces a CTL 
response and a T helper peptide comprising a motif of about nine residues wherein the first 
position from the N terminus of the motif is Y, F, W, L, I, V, M and the sixth position 

5 from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 

2. The composition of claim 1 , wherein the T helper peptide consists of 
between about 10 and about 24 residues. 

10 3. The composition of claim 1, wherein the T helper peptide is derived 

from a viral antigen. 

The composition of claim 3, wherein the viral antigen is from HIV, 
The composition of claim 1, wherein the T helper peptide is derived 

6. The composition of claim 5, wherein the antigen is Plasmodium 

20 falciparum. 

7. The composition of claim 1, wherein the peptide that induces a CTL 
response is linked to the T helper peptide. 

25 8. A method of inducing a CTL response in a patient, the method 

comprising contacting a cytotoxic T cell from the patient with an isolated peptide that 
induces a CTL response and a T helper peptide comprising a motif of about nine residues 
wherein the first position from the N terminus of the motif is Y, F, W, L, I, V, M and the 
sixth position from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 



4. 

HBV, or HCV. 

15 

5. 

from a parasite. 
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9. The method of claim 8, wherein the step of contacting is carried out 
by administering to the patient a pharmaceutical composition comprising the nucleic acid 
encoding the peptide that induces a CTL response and the T helper peptide. 

10. The method of claim 8, wherein the the peptide that induces a CTL 
response is linked to the T helper peptide, 

11. A composition comprising a peptide as shown in Table VIII. 

12. A method of inducing a helper T cell response in a patient, the 
method comprising contacting a helper T cell with a peptide of claim 11. 

13. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising the peptide. 



14. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising a nucleic acid 
encoding the peptide. 
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